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Abstract 



r~^ ' Robbins' problem of optimal stopping asks one to minimise the expected rank 

of observation chosen by some nonanticipating stopping rule. We settle a con- 
jecture regarding the value of the stopped variable under the rule optimal in the 
p/ I sense of the rank, by embedding the problem in a much more general context 

Ph ■ of selection problems with the nonanticipation constraint lifted, and with the 

r~| , payoff growing like a power function of the rank. 

1. Let Xi, . . . , Xn be independent random variables sampled sequentially from the 
uniform [0, 1] distribution, and let Yi < . . . < y„ be their order statistics. The rank 
Rj of the variable Xj is defined by setting Rj = k on the event Xj = Y^. Robbins' 
problem of optimal stopping [3] asks one to minimize the expected rank KRj. over 
all stopping times r that assume values in {1, . . . , n} and are adapted to the natural 
filtration of the sequence Xi,...,X„. Let r^ be the optimal stopping time. The 
minimum expected rank Ei?T^„ increases as n grows, and converges to some finite 

O ' limit V whose exact value is unknown. The closest known upper bound is slightly 

less than 7/3. Finding v or even improving the existing rough bounds remains a 
challenge. A major source of difficulties is that the optimal stopping time Tn is a 
very complicated function of the sample. It seems that r„ has not been computed 

r> I for n > 3. Moreover, for large n there is no simplification, and the complexity of the 

c^ ' optimal stopping time persists in the 'n = cxd' limiting form of the problem [6]. 

In a recent paper Bruss and Swan [1] stressed that it is not even known if 
lim sup„ nEX^-^ is finite. They mentioned that the property was first conjectured 
in [2]. While the conjecture stems from the attempts to bound v by the comparison 
with much simpler problem of minimising EX,- (or minor variations of the prob- 
lem), it seems that the question is of independent interest as a relation between the 
stopped sample value and its rank. In this note we settle the conjecture by proving 
a considerably more general assertion: 



*Postal address: Department of Mathematics, Utrecht University, Postbus 80010, 3508 TA 
Utrecht, The Netherlands. E-mail address: A.V.Gnedin@uu.nl 

^Postal address: Faculty of Cybernetics, National T. Shevchenko University of Kiev, Kiev-01033, 
Ukraine. E-mail address: iksan72@mail.ru 



Proposition 1. Fix p > 0. For n = 1,2,... let an be a random variable with 
range {1, . . . , n} and arbitrary joint distribution with Xi, . . . , X„. Then 

limsupE[i?o-,J^ < cxD implies limsupn^E[Xo-„]^ < cxd. (1) 

n n 

In particular, liin n^E[XT-„]P < oo for Tn the stopping time minimising K[RrY over 

n—^oo 

all stopping times adapted to Xi, . . . , X„. 

The idea is to bound X„^ by exploiting properties of a random walk with negative 
drift. 

2. Let $,, ^1, ^2, • • • be iid nonnegative random variables with /z = E^ G (0, oo). Let 
Sk '■= (.1 + ■ ■ ■ + ^k and for A > /i let Mx =: supj!,>o(5'fc — AA;). 

Proposition 2. Forp > 

E^P+^ < oo <^=^ KM^ < oo. 

Proof. The moment condition on ^ is equivalent to E[(^ — A)^]^^^ < oo, and the 
result follows from Lemma 3.5 in [^. D 

Corollary 3. Suppose E^^^-'^ < oo and let a be a nonnegative integer random 
variable with Ecr^ < oo. Then KS^ < oo. 

Proof This follows from Sp < (Ma + XaY < Cp{Ml + X^a^), where Cp := 2^-^ VI. D 

3. We can apply Corollary [3] to a Poisson-embedded, limiting form of the stopping 
problem with continuous time [6]. Let ^i, ^2, • • • be iid rate-one exponential variables, 
Sk as above, and let Ti, T2, . . . be iid uniform [0, 1] random times, independent of the 
^j's. The points (T^, Sk) are the atoms of a homogeneous planar Poisson process V 
in [0, 1] X [0, 00). To introduce the dynamics, consider an observer whose information 
at time t G [0, 1] is the (infinite) configuration of points of V within the strip [0, t\ x 
[0, 00), that is {(Tfe, 5*^) : Tk < t}. The rank of point (T^, 5*^) is defined as Rt^ = k, 
meaning that Sk is the kth smallest value among Si, S2, ■ ■ ■ ■ The piece of information 
added at time T^ is the point (T^, 5*^), but not the rank Rt^.- 

Suppose the objective of the observer is to minimize E[i?T-]^ over stopping times r 
that assume values in the random set {Ti, T2 . . . } and are adapted to the information 
flow of the observer. For the optimal stopping time Too it is known from the previous 
studies that E[i?T-oo]^ < 00 (see ^ and [5j). Taking a = Rto^, we have E[S'o-]^ < 00. 
The case p = 1 corresponds to the infinite version of Robbins's problem of minimising 
the expected rank. 

4. To apply the above to a finite sample, we shall use the familiar realisation of 
uniform order statistics through sums of exponential variables, as 

(Yk, l<k<n) = {Sk/Sn, l<k<n). 



Introducing the event An := {n/Sn > 1 + e}, we can estimate for 1 < A; < n 

n^ll = n^Y,n^„ + n^Y^lA^^ < nn^^ + (1 + efSl < nn^^ + c,(l + e)^(Mf + A^F), 

where we used S^ < Mx + AA;. Using a large deviation bound for the probabihty of 
An and sending e — )■ we conclude that for any random variable o"„ with values in 
{l,...,n} 

limsupnPE[F,JP < CpA^limsupE^ + CpEM^. 

n n 

Finally, taking o"„ = Rr„ , Proposition |2] follows from 

limsupnPE[X^JP < CpA^limsupEfi^^JP + CpEM^ < oo, 

n n 

since E[i?T-„]^ converges to a finite limit (see |6], [5]). 
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